Filling in an entity within an image

ABSTRACT

A computer-implemented method according to one embodiment includes identifying an entity to be filled in within an image, determining a three-dimensional (3D) model for the entity, and filling in the entity within the image, utilizing the 3D model.

BACKGROUND

The present invention relates to image and video correction, and morespecifically, this invention relates to identifying and repairingundesired elements within a displayed image or video.

Video and image editing tools are commonly used to edit and repairimages and video. However, current editing tools perform poorly whenused to repair undesired elements having irregular structures (e.g.,cars, people, etc.) within associated images and video.

SUMMARY

A computer-implemented method according to one embodiment includesidentifying an entity to be filled in within an image, determining athree-dimensional (3D) model for the entity, and filling in the entitywithin the image, utilizing the 3D model.

According to another embodiment, a computer program product for fillingin an entity within an image includes a computer readable storage mediumhaving program instructions embodied therewith, where the computerreadable storage medium is not a transitory signal per se, and where theprogram instructions are executable by a processor to cause theprocessor to perform a method comprising identifying the entity to befilled in within the image, utilizing the processor, determining athree-dimensional (3D) model for the entity, utilizing the processor,and filling in the entity within the image, utilizing the 3D model andthe processor.

A system according to another embodiment includes a processor, and logicintegrated with the processor, executable by the processor, orintegrated with and executable by the processor, where the logic isconfigured to identify an entity to be filled in within an image,determine a three-dimensional (3D) model for the entity, and fill in theentity within the image, utilizing the 3D model.

Other aspects and embodiments of the present invention will becomeapparent from the following detailed description, which, when taken inconjunction with the drawings, illustrate by way of example theprinciples of the invention.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 depicts a cloud computing node according to an embodiment of thepresent invention.

FIG. 2 depicts a cloud computing environment according to an embodimentof the present invention.

FIG. 3 depicts abstraction model layers according to an embodiment ofthe present invention.

FIG. 4 illustrates a method for filling in an entity within an image, inaccordance with one embodiment.

FIG. 5 illustrates a method for filling in an entity within a video, inaccordance with one embodiment.

FIG. 6 illustrates a method for improving content-aware image fillingusing object recognition and 3D representations, in accordance with oneembodiment.

DETAILED DESCRIPTION

The following description discloses several preferred embodiments ofsystems, methods and computer program products for filling in an entitywithin an image. Various embodiments provide a method to identify anentity to be filled in within an image, determine a 3D model associatedwith the entity, and fill in the entity, utilizing the 3D model.

The following description is made for the purpose of illustrating thegeneral principles of the present invention and is not meant to limitthe inventive concepts claimed herein. Further, particular featuresdescribed herein can be used in combination with other describedfeatures in each of the various possible combinations and permutations.

Unless otherwise specifically defined herein, all terms are to be giventheir broadest possible interpretation including meanings implied fromthe specification as well as meanings understood by those skilled in theart and/or as defined in dictionaries, treatises, etc.

It must also be noted that, as used in the specification and theappended claims, the singular forms “a,” “an” and “the” include pluralreferents unless otherwise specified. It will be further understood thatthe terms “includes” and/or “comprising,” when used in thisspecification, specify the presence of stated features, integers, steps,operations, elements, and/or components, but do not preclude thepresence or addition of one or more other features, integers, steps,operations, elements, components, and/or groups thereof.

The following description discloses several preferred embodiments ofsystems, methods and computer program products for filling in an entitywithin an image.

In one general embodiment, a computer-implemented method includesidentifying an entity to be filled in within an image, determining athree-dimensional (3D) model for the entity, and filling in the entitywithin the image, utilizing the 3D model.

In another general embodiment, a computer program product for filling inan entity within an image includes a computer readable storage mediumhaving program instructions embodied therewith, where the computerreadable storage medium is not a transitory signal per se, and where theprogram instructions are executable by a processor to cause theprocessor to perform a method comprising identifying the entity to befilled in within the image, utilizing the processor, determining athree-dimensional (3D) model for the entity, utilizing the processor,and filling in the entity within the image, utilizing the 3D model andthe processor.

In another general embodiment, a system includes a processor, and logicintegrated with the processor, executable by the processor, orintegrated with and executable by the processor, where the logic isconfigured to identify an entity to be filled in within an image,determine a three-dimensional (3D) model for the entity, and fill in theentity within the image, utilizing the 3D model.

It is understood in advance that although this disclosure includes adetailed description on cloud computing, implementation of the teachingsrecited herein are not limited to a cloud computing environment. Rather,embodiments of the present invention are capable of being implemented inconjunction with any other type of computing environment now known orlater developed.

Cloud computing is a model of service delivery for enabling convenient,on-demand network access to a shared pool of configurable computingresources (e.g. networks, network bandwidth, servers, processing,memory, storage, applications, virtual machines, and services) that canbe rapidly provisioned and released with minimal management effort orinteraction with a provider of the service. This cloud model may includeat least five characteristics, at least three service models, and atleast four deployment models.

Characteristics are as follows:

On-demand self-service: a cloud consumer can unilaterally provisioncomputing capabilities, such as server time and network storage, asneeded automatically without requiring human interaction with theservice's provider.

Broad network access: capabilities are available over a network andaccessed through standard mechanisms that promote use by heterogeneousthin or thick client platforms (e.g., mobile phones, laptops, and PDAs).

Resource pooling: the provider's computing resources are pooled to servemultiple consumers using a multi-tenant model, with different physicaland virtual resources dynamically assigned and reassigned according todemand. There is a sense of location independence in that the consumergenerally has no control or knowledge over the exact location of theprovided resources but may be able to specify location at a higher levelof abstraction (e.g., country, state, or datacenter).

Rapid elasticity: capabilities can be rapidly and elasticallyprovisioned, in some cases automatically, to quickly scale out andrapidly released to quickly scale in. To the consumer, the capabilitiesavailable for provisioning often appear to be unlimited and can bepurchased in any quantity at any time.

Measured service: cloud systems automatically control and optimizeresource use by leveraging a metering capability at some level ofabstraction appropriate to the type of service (e.g., storage,processing, bandwidth, and active user accounts). Resource usage can bemonitored, controlled, and reported providing transparency for both theprovider and consumer of the utilized service.

Service Models are as follows:

Software as a Service (SaaS): the capability provided to the consumer isto use the provider's applications running on a cloud infrastructure.The applications are accessible from various client devices through athin client interface such as a web browser (e.g., web-based e-mail).The consumer does not manage or control the underlying cloudinfrastructure including network, servers, operating systems, storage,or even individual application capabilities, with the possible exceptionof limited user-specific application configuration settings.

Platform as a Service (PaaS): the capability provided to the consumer isto deploy onto the cloud infrastructure consumer-created or acquiredapplications created using programming languages and tools supported bythe provider. The consumer does not manage or control the underlyingcloud infrastructure including networks, servers, operating systems, orstorage, but has control over the deployed applications and possiblyapplication hosting environment configurations.

Infrastructure as a Service (IaaS): the capability provided to theconsumer is to provision processing, storage, networks, and otherfundamental computing resources where the consumer is able to deploy andrun arbitrary software, which can include operating systems andapplications. The consumer does not manage or control the underlyingcloud infrastructure but has control over operating systems, storage,deployed applications, and possibly limited control of select networkingcomponents (e.g., host firewalls).

Deployment Models are as follows:

Private cloud: the cloud infrastructure is operated solely for anorganization. It may be managed by the organization or a third party andmay exist on-premises or off-premises.

Community cloud: the cloud infrastructure is shared by severalorganizations and supports a specific community that has shared concerns(e.g., mission, security requirements, policy, and complianceconsiderations). It may be managed by the organizations or a third partyand may exist on-premises or off-premises.

Public cloud: the cloud infrastructure is made available to the generalpublic or a large industry group and is owned by an organization sellingcloud services.

Hybrid cloud: the cloud infrastructure is a composition of two or moreclouds (private, community, or public) that remain unique entities butare bound together by standardized or proprietary technology thatenables data and application portability (e.g., cloud bursting forload-balancing between clouds).

A cloud computing environment is service oriented with a focus onstatelessness, low coupling, modularity, and semantic interoperability.At the heart of cloud computing is an infrastructure comprising anetwork of interconnected nodes.

Referring now to FIG. 1, a schematic of an example of a cloud computingnode is shown. Cloud computing node 10 is only one example of a suitablecloud computing node and is not intended to suggest any limitation as tothe scope of use or functionality of embodiments of the inventiondescribed herein. Regardless, cloud computing node 10 is capable ofbeing implemented and/or performing any of the functionality set forthhereinabove.

In cloud computing node 10 there is a computer system/server 12, whichis operational with numerous other general purpose or special purposecomputing system environments or configurations. Examples of well-knowncomputing systems, environments, and/or configurations that may besuitable for use with computer system/server 12 include, but are notlimited to, personal computer systems, server computer systems, thinclients, thick clients, hand-held or laptop devices, multiprocessorsystems, microprocessor-based systems, set top boxes, programmableconsumer electronics, network PCs, minicomputer systems, mainframecomputer systems, and distributed cloud computing environments thatinclude any of the above systems or devices, and the like.

Computer system/server 12 may be described in the general context ofcomputer system-executable instructions, such as program modules, beingexecuted by a computer system. Generally, program modules may includeroutines, programs, objects, components, logic, data structures, and soon that perform particular tasks or implement particular abstract datatypes. Computer system/server 12 may be practiced in distributed cloudcomputing environments where tasks are performed by remote processingdevices that are linked through a communications network. In adistributed cloud computing environment, program modules may be locatedin both local and remote computer system storage media including memorystorage devices.

As shown in FIG. 1, computer system/server 12 in cloud computing node 10is shown in the form of a general-purpose computing device. Thecomponents of computer system/server 12 may include, but are not limitedto, one or more processors or processing units 16, a system memory 28,and a bus 18 that couples various system components including systemmemory 28 to processor 16.

Bus 18 represents one or more of any of several types of bus structures,including a memory bus or memory controller, a peripheral bus, anaccelerated graphics port, and a processor or local bus using any of avariety of bus architectures. By way of example, and not limitation,such architectures include Industry Standard Architecture (ISA) bus,Micro Channel Architecture (MCA) bus, Enhanced ISA (EISA) bus, VideoElectronics Standards Association (VESA) local bus, and PeripheralComponent Interconnects (PCI) bus.

Computer system/server 12 typically includes a variety of computersystem readable media. Such media may be any available media that isaccessible by computer system/server 12, and it includes both volatileand non-volatile media, removable and non-removable media.

System memory 28 can include computer system readable media in the formof volatile memory, such as random access memory (RAM) 30 and/or cachememory 32. Computer system/server 12 may further include otherremovable/non-removable, volatile/non-volatile computer system storagemedia. By way of example only, storage system 34 can be provided forreading from and writing to a non-removable, non-volatile magnetic media(not shown and typically called a “hard drive”). Although not shown, amagnetic disk drive for reading from and writing to a removable,non-volatile magnetic disk (e.g., a “floppy disk”), and an optical diskdrive for reading from or writing to a removable, non-volatile opticaldisk such as a CD-ROM, DVD-ROM or other optical media can be provided.In such instances, each can be connected to bus 18 by one or more datamedia interfaces. As will be further depicted and described below,memory 28 may include at least one program product having a set (e.g.,at least one) of program modules that are configured to carry out thefunctions of embodiments of the invention.

Program/utility 40, having a set (at least one) of program modules 42,may be stored in memory 28 by way of example, and not limitation, aswell as an operating system, one or more application programs, otherprogram modules, and program data. Each of the operating system, one ormore application programs, other program modules, and program data orsome combination thereof, may include an implementation of a networkingenvironment. Program modules 42 generally carry out the functions and/ormethodologies of embodiments of the invention as described herein.

Computer system/server 12 may also communicate with one or more externaldevices 14 such as a keyboard, a pointing device, a display 24, etc.;one or more devices that enable a user to interact with computersystem/server 12; and/or any devices (e.g., network card, modem, etc.)that enable computer system/server 12 to communicate with one or moreother computing devices. Such communication can occur via Input/Output(I/O) interfaces 22. Still yet, computer system/server 12 cancommunicate with one or more networks such as a local area network(LAN), a general wide area network (WAN), and/or a public network (e.g.,the Internet) via network adapter 20. As depicted, network adapter 20communicates with the other components of computer system/server 12 viabus 18. It should be understood that although not shown, other hardwareand/or software components could be used in conjunction with computersystem/server 12. Examples, include, but are not limited to: microcode,device drivers, redundant processing units, external disk drive arrays,RAID systems, tape drives, and data archival storage systems, etc.

Referring now to FIG. 2, illustrative cloud computing environment 50 isdepicted. As shown, cloud computing environment 50 includes one or morecloud computing nodes 10 with which local computing devices used bycloud consumers, such as, for example, personal digital assistant (PDA)or cellular telephone 54A, desktop computer 54B, laptop computer 54C,and/or automobile computer system 54N may communicate. Nodes 10 maycommunicate with one another. They may be grouped (not shown) physicallyor virtually, in one or more networks, such as Private, Community,Public, or Hybrid clouds as described hereinabove, or a combinationthereof. This allows cloud computing environment 50 to offerinfrastructure, platforms and/or software as services for which a cloudconsumer does not need to maintain resources on a local computingdevice. It is understood that the types of computing devices 54A-N shownin FIG. 2 are intended to be illustrative only and that computing nodes10 and cloud computing environment 50 can communicate with any type ofcomputerized device over any type of network and/or network addressableconnection (e.g., using a web browser).

Referring now to FIG. 3, a set of functional abstraction layers providedby cloud computing environment 50 (FIG. 2) is shown. It should beunderstood in advance that the components, layers, and functions shownin FIG. 3 are intended to be illustrative only and embodiments of theinvention are not limited thereto. As depicted, the following layers andcorresponding functions are provided:

Hardware and software layer 60 includes hardware and softwarecomponents. Examples of hardware components include: mainframes 61; RISC(Reduced Instruction Set Computer) architecture based servers 62;servers 63; blade servers 64; storage devices 65; and networks andnetworking components 66. In some embodiments, software componentsinclude network application server software 67 and database software 68.

Virtualization layer 70 provides an abstraction layer from which thefollowing examples of virtual entities may be provided: virtual servers71; virtual storage 72; virtual networks 73, including virtual privatenetworks; virtual applications and operating systems 74; and virtualclients 75.

In one example, management layer 80 may provide the functions describedbelow. Resource provisioning 81 provides dynamic procurement ofcomputing resources and other resources that are utilized to performtasks within the cloud computing environment. Metering and Pricing 82provide cost tracking as resources are utilized within the cloudcomputing environment, and billing or invoicing for consumption of theseresources. In one example, these resources may include applicationsoftware licenses. Security provides identity verification for cloudconsumers and tasks, as well as protection for data and other resources.User portal 83 provides access to the cloud computing environment forconsumers and system administrators. Service level management 84provides cloud computing resource allocation and management such thatrequired service levels are met. Service Level Agreement (SLA) planningand fulfillment 85 provide pre-arrangement for, and procurement of,cloud computing resources for which a future requirement is anticipatedin accordance with an SLA.

Workloads layer 90 provides examples of functionality for which thecloud computing environment may be utilized. Examples of workloads andfunctions which may be provided from this layer include: mapping andnavigation 91; software development and lifecycle management 92; virtualclassroom education delivery 93; data analytics processing 94;transaction processing 95; and card stunt as a service (CaaS) 96.

Now referring to FIG. 4, a flowchart of a method 400 for filling in anentity within an image is shown according to one embodiment. The method400 may be performed in accordance with the present invention in any ofthe environments depicted in FIGS. 1-3, among others, in variousembodiments. Of course, more or less operations than those specificallydescribed in FIG. 4 may be included in method 400, as would beunderstood by one of skill in the art upon reading the presentdescriptions.

Each of the steps of the method 400 may be performed by any suitablecomponent of the operating environment. For example, in variousembodiments, the method 400 may be partially or entirely performed byone or more servers, computers, or some other device having one or moreprocessors therein. The processor, e.g., processing circuit(s), chip(s),and/or module(s) implemented in hardware and/or software, and preferablyhaving at least one hardware component may be utilized in any device toperform one or more steps of the method 400. Illustrative processorsinclude, but are not limited to, a central processing unit (CPU), anapplication specific integrated circuit (ASIC), a field programmablegate array (FPGA), etc., combinations thereof, or any other suitablecomputing device known in the art.

As shown in FIG. 4, method 400 may initiate with operation 402, where anentity to be filled in within an image is identified. In one embodiment,the image may include a photograph, a digitally created image, etc. Forexample, the image may be a portion of a video (e.g., a frame of thevideo, etc.), a virtual reality image, etc. In another embodiment, theentity may be identified within an image editing application. In yetanother embodiment, the entity may be identified in real-time duringreplay.

Additionally, in one embodiment, the entity may include an objectincluded within the image. In another embodiment, the entity may beidentified as an object with an irregular structure. For example, theentity may match one or more stored 3D models that are classified asirregular structures. In another example, the 3D models may be storedlocally and/or globally (e.g., shared by a plurality of users). In yetanother example, the entity may include a vehicle shown within theimage, a human or animal body shown within the image, a face shownwithin the image, etc.

Further, in one embodiment, the entity may be identified in response touser feedback. For example, a first attempt may be made to fill in anarea of the image including the entity (e.g., utilizing a tool withinthe image editing application for filling in regular structures), but itmay be detected that the results of the first attempt have been undoneby a user of the application.

In another embodiment, the entity to be filled in may be incomplete, mayhave one or more missing portions, etc. For example, portions of theentity may be missing as shown within the image (such as missing/emptypixels within the entity, etc.). In another example, the one or moremissing/empty portions may result from one or more actions that havebeen performed on the image. For instance, the image may be correctedfor perspective and/or distortion, which may result in the one or moremissing/empty portions within the image that may need to be correctedthrough filling.

Further still, in one embodiment, the entity may be identified inresponse to a selection of a portion of the image by the user. Forexample, the user may select a defined area within the image (e.g., tobe filled in, etc.). In another example, the entity may be locatedwithin the defined area. In another embodiment, the entity may beidentified utilizing one or more image recognition techniques. Forexample, the image recognition may be implemented utilizing one or moreimage recognition algorithms. In another example, the image recognitionmay be implemented utilizing one or more neural networks. In yet anotherexample, the image recognition may be implemented utilizing one or moreobject detection APIs.

Also, in one embodiment, the one or more image recognition techniquesmay utilize local image data to refine results. For example, imagesstored locally on a user's device may be used during image recognitionto assist in identifying the entity. In another embodiment, the resultsof the one or more image recognition techniques may include metadataassociated with the entity. For example, the metadata may include anidentification of the entity (e.g., whether the entity is a person, avehicle, etc.). In another example, the metadata may include aclassification of the entity (e.g., a type of vehicle (boat, car, plane,etc.)). In yet another example, the metadata may include one or morespecific characteristics of the entity. For instance, the metadata mayinclude any branding/labels that are identified as part of the entity(e.g., license plate, model badge, etc.), a type of the entity, etc.

In addition, in one embodiment, additional metadata may be associatedwith the image. For example, the metadata may include a date and/or timethe image was created (e.g., a time/date stamp, etc.). In anotherexample, the metadata may include a geographical location where theimage was created (e.g., a geo tag associated with an image taken by acamera, etc.). In another embodiment, the identifying may be performedwithin an individual computing device and/or at a cloud computingenvironment.

Furthermore, as shown in FIG. 4, method 400 may proceed with operation404, where a three-dimensional (3D) model is determined for the entity.In one embodiment, determining the 3D model may include using edgedetection to determine one or more contours of the entity within theimage. For example, edge detection may be implemented utilizing the oneor more image recognition techniques, utilizing a separate edgedetection application, etc.

Further still, in one embodiment, determining the 3D model may includeretrieving one or more 3D models from a model library. For example, the3D models may be retrieved utilizing the identification of the entity.In another example, the 3D models may be retrieved utilizing metadataassociated with the entity, additional metadata associated with theimage, etc. In another embodiment, the model library may include adatabase of 3D models shared by a plurality of users. For example, themodel library may be locally or remotely based. In another example, themodel library may be contributed to and shared by a plurality ofdifferent users. In yet another example, each model within the librarymay be associated with various metadata describing characteristics ofthe model.

Also, in one embodiment, the 3D models may each include a wireframemodel. In another embodiment, one or more of the 3D models may beassociated with one or more textures. In yet another embodiment,determining the 3D model may include refining the models retrieved fromthe library to determine the 3D model. For example, a plurality ofdifferent models may be returned from the library. In another example,the 3D model may be selected from the plurality of different models,utilizing the additional metadata associated with the image. In thisway, the results may be refined utilizing information associated withthe image, such as time and location information.

Additionally, in one embodiment, the determining may be performed withinan individual computing device and/or at a cloud computing environment.

Further, as shown in FIG. 4, method 400 may proceed with operation 406,where the entity is filled in within the image, utilizing the 3D model.In one embodiment, filling in the entity may include applying atransformation to the 3D model. For example, a transformation may beapplied to the 3D model to manipulate the model so that it matches ashape and viewing angle of the entity. In another example, thetransformation may be applied using one or more image registrationtechniques.

Further still, in one embodiment, filling in the entity may includeusing the transformation to project the 3D model onto a selected areawithin the image that needs to be filled (e.g., one or more missingportions of the entity, etc.). In another embodiment, filling in theentity may include mapping each surface of the 3D model to the image.For example, the 3D model of the entity may be mapped to its 2Drepresentation within the image, utilizing image registration. Inanother example, the surfaces of the 3D model may include one or morecharacteristics of the 3D model (e.g., for a car, the surfaces mayinclude one or more windows, wheels, doors, etc.).

Also, in one embodiment, filling in the entity may include applying atexture to the portions of the surface that need to be filled. Forexample, the texture may be learned from a texture of an adjacentsurface outside of the fill area. In another example, the texture may belearned from other images (e.g., related images within a local or shareddatabase). In yet another example, the texture may be learned from otherentities within the image. In still another example, the texture may beassociated with the 3D model itself.

In addition, in one embodiment, one or more areas around the entity(e.g., areas within the portion of the image selected by the user) mayalso be filled in. For example, these areas may be filled in utilizing atool within the image editing application for filling in regularstructures. In another embodiment, filling in the entity may result in areconstructed entity within a reconstructed image.

Furthermore, in one embodiment, the reconstructed entity within theimage may be validated. For example, image recognition may be performedon the reconstructed entity and/or the image. In another example, afirst confidence score may be obtained as a result of the imagerecognition. In yet another example, the first confidence score for thereconstructed image may be compared to a second confidence scorepreviously obtained for the original image. In still another example, ifthe first confidence score is not higher than the second confidencescore, a different 3D model may be determined for the entity and usedfor filling in the entity.

Further still, in one embodiment, validating the reconstructed entitymay include identifying user feedback. For example, it may be detectedwhether the filling in of the entity has been undone by a user of theapplication. In another embodiment, if the filling has been undone, adifferent 3D model may be determined for the entity and used for fillingin the entity. In yet another embodiment, the filling may be performedwithin an individual computing device and/or at a cloud computingenvironment. Also, in one embodiment, the entity may be identified andfilled in within a plurality of images (e.g., a plurality of frames of avideo, etc.), utilizing one or more of the techniques described above.

In this way, incorrect and/or missing pixels associated with the entitymay be filled in, utilizing a 3D model that is retrieved for the entitybased on an identification of the entity within the image.

Now referring to FIG. 5, a flowchart of a method 500 for filling in anentity within a video is shown according to one embodiment. The method500 may be performed in accordance with the present invention in any ofthe environments depicted in FIGS. 1-3, among others, in variousembodiments. Of course, more or less operations than those specificallydescribed in FIG. 5 may be included in method 500, as would beunderstood by one of skill in the art upon reading the presentdescriptions.

Each of the steps of the method 500 may be performed by any suitablecomponent of the operating environment. For example, in variousembodiments, the method 500 may be partially or entirely performed byone or more servers, computers, or some other device having one or moreprocessors therein. The processor, e.g., processing circuit(s), chip(s),and/or module(s) implemented in hardware and/or software, and preferablyhaving at least one hardware component may be utilized in any device toperform one or more steps of the method 500. Illustrative processorsinclude, but are not limited to, a central processing unit (CPU), anapplication specific integrated circuit (ASIC), a field programmablegate array (FPGA), etc., combinations thereof, or any other suitablecomputing device known in the art.

As shown in FIG. 5, method 500 may initiate with operation 502, where anentity to be filled in is identified within a selected plurality ofvideo frames. In one embodiment, the selected plurality of video framesmay include all or a portion of a video. In another embodiment, theselected plurality of video frames may be selected by a user. In yetanother embodiment, each of the selected plurality of video frames mayinclude a still image.

Additionally, in one embodiment, the entity may be identified within avideo editing application. In another embodiment, the entity may beidentified in real-time during replay of the video. In yet anotherembodiment, the entity may include an object displayed within theselected plurality of video frames. For example, each of the selectedplurality of video frames may display the object from a different angleof view when compared to the other selected plurality of video frames.

Further, in one embodiment, the entity may be included within apredetermined area of the selected plurality of video frames. In anotherembodiment, the predetermined area may be selected by a user. Forexample, the predetermined area may be selected by the user within thevideo editing application. In yet another embodiment, the entity may beidentified utilizing one or more image recognition techniques. Forexample, the one or more image recognition techniques may be appliedindividually to each of the selected plurality of video frames.

Further still, in one embodiment, the entity may be identified fromother video frames of the video. For example, the selected plurality ofvideo frames may include only a portion of the total video frames of avideo, and the other video frames may include video frames that comebefore and/or after the selected plurality of video frames within thevideo. In another example, the other video frames may be compared to theselected plurality of video frames to determine if the entity is locatedin one or more of the other video frames. In yet another example, upondetermining that the entity is located within one or more of the othervideo frames, one or more image recognition techniques may be applied tothe entity identified within the one or more other video frames.

In this way, entity identification may be enhanced, utilizing videoframes appearing earlier or later than the selected plurality of videoframes within the video.

Also, method 500 may proceed with operation 504, where athree-dimensional (3D) model is determined for the entity. In oneembodiment, the edge detection may be used to determine one or morecontours of the entity within the image. In another embodiment, one ormore 3D models may be retrieved from a model library. In yet anotherembodiment, the models retrieved from the library may be refined todetermine the 3D model. For example, the retrieved models may be refinedutilizing information derived from other video frames of the video.

In addition, method 500 may proceed with operation 506, where the entityis filled in within the selected plurality of video frames, utilizingthe 3D model. In one embodiment, a transformation may be applied to the3D model. In another embodiment, the transformation may be used toproject the 3D model onto a selected area within the selected pluralityof video frames that needs to be filled. In yet another embodiment, eachsurface of the 3D model may be mapped to the selected plurality of videoframes.

Furthermore, in one embodiment, a texture may be applied to the portionsof the surface that need to be filled. In another embodiment, thetexture may be learned from information derived from other video framesof the video. In yet another embodiment, one or more areas around theentity may also be filled in. In still another embodiment, the fillingin of the entity and the one or more areas around the entity may resultin a reconstructed entity within reconstructed video frames. In anotherembodiment, the reconstructed entity within the reconstructed videoframes may be validated.

Now referring to FIG. 6, a flowchart of a method 600 for improvingcontent-aware image filling using object recognition and 3Drepresentations is shown according to one embodiment. The method 600 maybe performed in accordance with the present invention in any of theenvironments depicted in FIGS. 1-3, among others, in variousembodiments. Of course, more or less operations than those specificallydescribed in FIG. 6 may be included in method 600, as would beunderstood by one of skill in the art upon reading the presentdescriptions.

Each of the steps of the method 600 may be performed by any suitablecomponent of the operating environment. For example, in variousembodiments, the method 600 may be partially or entirely performed byone or more servers, computers, or some other device having one or moreprocessors therein. The processor, e.g., processing circuit(s), chip(s),and/or module(s) implemented in hardware and/or software, and preferablyhaving at least one hardware component may be utilized in any device toperform one or more steps of the method 600. Illustrative processorsinclude, but are not limited to, a central processing unit (CPU), anapplication specific integrated circuit (ASIC), a field programmablegate array (FPGA), etc., combinations thereof, or any other suitablecomputing device known in the art.

As shown in FIG. 6, method 600 may initiate with operation 602, where auser selection of an area is received, where the area is within an imageand needs to be filled in based on its surroundings. In one embodiment,the user may make the selection utilizing an image editing application.In another embodiment, the user may make the selection in real-timeduring a display of the image.

Additionally, method 600 may proceed with operation 604, where an entitythat needs to be filled in within the selected area is identified,utilizing image recognition. In one embodiment, the image recognitionmay be performed utilizing one or more techniques (e.g., an imagerecognition service within the image editing application, etc.). Inanother embodiment, only entities that are poorly handled by other filltools (e.g., entities with an irregular structure, etc.) may beidentified to be filled in. In yet another embodiment, the entity may beidentified as matching an available 3D model within a database. Forexample, if the entity does not match an available 3D model within thedatabase, the entity may not be identified to be filled in.

Further, method 600 may proceed with operation 606, where exact contoursfor the entity are determined, utilizing edge detection. In oneembodiment, the contours of the entity may be provided (entirely orpartially) by the image recognition (e.g., an image recognitionalgorithm, etc.).

Further still, method 600 may proceed with operation 608, where 3Dmodels of the entity are retrieved from a model library. In oneembodiment, the 3D models may each include a wireframe model withoptional textures. Also, method 600 may proceed with operation 610,where a selection of the retrieved 3D models is narrowed down to asingle 3D model for the entity. In one embodiment, the selection may benarrowed utilizing geo tag data and a timestamp associated with theimage.

For example, the geo tag data and timestamp data associated with theimage may indicate that the image is a picture taken in Japan ten yearsago, and the entity may be identified as an automobile. In response, theretrieved 3D models may then be limited to models of automobiles with apredetermined level of popularity within Japan ten years ago. In anotherexample, the geo tag data and timestamp data associated with the imagemay help to determine time and location-appropriate clothing when theentity is identified as a person.

In addition, method 600 may proceed with operation 612, where atransformation is applied to the single 3D model to match a shape andviewing angle of the entity within the image. Furthermore, method 600may proceed with operation 614, where the 3D model is projected onto theselected area that needs to be filled, using a transformation. Furtherstill, method 600 may proceed with operation 616, where each surface ofthe 3D model is mapped to an area in the image. For example, when theentity is identified as an automobile, the surfaces may include one ormore windows, doors, wheels, etc.

Also, method 600 may proceed with operation 618, where a texture isapplied to each surface of the 3D model within the image area to befilled. In one embodiment, the texture may be learned from an existingtexture of the same surface outside of the fill area. In anotherembodiment, the texture may be learned from other images. In yet anotherembodiment, the texture may be associated with the 3D model.

Additionally, method 600 may proceed with operation 620, where acontext-aware fill is applied to all parts of the image around theentity within the selected area. Further, method 600 may proceed withoperation 622, where the filled image is validated. In one embodiment,the reconstructed image may pass through an image recognition service toevaluate a quality of the filled area.

In this way, content-aware image filling may be performed on areas of animage that include recognizable entities with irregular structures.

The present invention may be a system, a method, and/or a computerprogram product. The computer program product may include a computerreadable storage medium (or media) having computer readable programinstructions thereon for causing a processor to carry out aspects of thepresent invention.

The computer readable storage medium can be a tangible device that canretain and store instructions for use by an instruction executiondevice. The computer readable storage medium may be, for example, but isnot limited to, an electronic storage device, a magnetic storage device,an optical storage device, an electromagnetic storage device, asemiconductor storage device, or any suitable combination of theforegoing. A non-exhaustive list of more specific examples of thecomputer readable storage medium includes the following: a portablecomputer diskette, a hard disk, a random access memory (RAM), aread-only memory (ROM), an erasable programmable read-only memory (EPROMor Flash memory), a static random access memory (SRAM), a portablecompact disc read-only memory (CD-ROM), a digital versatile disk (DVD),a memory stick, a floppy disk, a mechanically encoded device such aspunch-cards or raised structures in a groove having instructionsrecorded thereon, and any suitable combination of the foregoing. Acomputer readable storage medium, as used herein, is not to be construedas being transitory signals per se, such as radio waves or other freelypropagating electromagnetic waves, electromagnetic waves propagatingthrough a waveguide or other transmission media (e.g., light pulsespassing through a fiber-optic cable), or electrical signals transmittedthrough a wire.

Computer readable program instructions described herein can bedownloaded to respective computing/processing devices from a computerreadable storage medium or to an external computer or external storagedevice via a network, for example, the Internet, a local area network, awide area network and/or a wireless network. The network may comprisecopper transmission cables, optical transmission fibers, wirelesstransmission, routers, firewalls, switches, gateway computers and/oredge servers. A network adapter card or network interface in eachcomputing/processing device receives computer readable programinstructions from the network and forwards the computer readable programinstructions for storage in a computer readable storage medium withinthe respective computing/processing device.

Computer readable program instructions for carrying out operations ofthe present invention may be assembler instructions,instruction-set-architecture (ISA) instructions, machine instructions,machine dependent instructions, microcode, firmware instructions,state-setting data, or either source code or object code written in anycombination of one or more programming languages, including an objectoriented programming language such as Smalltalk, C++ or the like, andconventional procedural programming languages, such as the “C”programming language or similar programming languages. The computerreadable program instructions may execute entirely on the user'scomputer, partly on the user's computer, as a stand-alone softwarepackage, partly on the user's computer and partly on a remote computeror entirely on the remote computer or server. In the latter scenario,the remote computer may be connected to the user's computer through anytype of network, including a local area network (LAN) or a wide areanetwork (WAN), or the connection may be made to an external computer(for example, through the Internet using an Internet Service Provider).In some embodiments, electronic circuitry including, for example,programmable logic circuitry, field-programmable gate arrays (FPGA), orprogrammable logic arrays (PLA) may execute the computer readableprogram instructions by utilizing state information of the computerreadable program instructions to personalize the electronic circuitry,in order to perform aspects of the present invention.

Aspects of the present invention are described herein with reference toflowchart illustrations and/or block diagrams of methods, apparatus(systems), and computer program products according to embodiments of theinvention. It will be understood that each block of the flowchartillustrations and/or block diagrams, and combinations of blocks in theflowchart illustrations and/or block diagrams, can be implemented bycomputer readable program instructions.

These computer readable program instructions may be provided to aprocessor of a general purpose computer, special purpose computer, orother programmable data processing apparatus to produce a machine, suchthat the instructions, which execute via the processor of the computeror other programmable data processing apparatus, create means forimplementing the functions/acts specified in the flowchart and/or blockdiagram block or blocks. These computer readable program instructionsmay also be stored in a computer readable storage medium that can directa computer, a programmable data processing apparatus, and/or otherdevices to function in a particular manner, such that the computerreadable storage medium having instructions stored therein includes anarticle of manufacture including instructions which implement aspects ofthe function/act specified in the flowchart and/or block diagram blockor blocks.

The computer readable program instructions may also be loaded onto acomputer, other programmable data processing apparatus, or other deviceto cause a series of operational steps to be performed on the computer,other programmable apparatus or other device to produce a computerimplemented process, such that the instructions which execute on thecomputer, other programmable apparatus, or other device implement thefunctions/acts specified in the flowchart and/or block diagram block orblocks.

The flowchart and block diagrams in the Figures illustrate thearchitecture, functionality, and operation of possible implementationsof systems, methods, and computer program products according to variousembodiments of the present invention. In this regard, each block in theflowchart or block diagrams may represent a module, segment, or portionof instructions, which includes one or more executable instructions forimplementing the specified logical function(s). In some alternativeimplementations, the functions noted in the block may occur out of theorder noted in the figures. For example, two blocks shown in successionmay, in fact, be executed substantially concurrently, or the blocks maysometimes be executed in the reverse order, depending upon thefunctionality involved. It will also be noted that each block of theblock diagrams and/or flowchart illustration, and combinations of blocksin the block diagrams and/or flowchart illustration, can be implementedby special purpose hardware-based systems that perform the specifiedfunctions or acts or carry out combinations of special purpose hardwareand computer instructions.

Moreover, a system according to various embodiments may include aprocessor and logic integrated with and/or executable by the processor,the logic being configured to perform one or more of the process stepsrecited herein. By integrated with, what is meant is that the processorhas logic embedded therewith as hardware logic, such as an applicationspecific integrated circuit (ASIC), a FPGA, etc. By executable by theprocessor, what is meant is that the logic is hardware logic; softwarelogic such as firmware, part of an operating system, part of anapplication program; etc., or some combination of hardware and softwarelogic that is accessible by the processor and configured to cause theprocessor to perform some functionality upon execution by the processor.Software logic may be stored on local and/or remote memory of any memorytype, as known in the art. Any processor known in the art may be used,such as a software processor module and/or a hardware processor such asan ASIC, a FPGA, a central processing unit (CPU), an integrated circuit(IC), a graphics processing unit (GPU), etc.

It will be clear that the various features of the foregoing systemsand/or methodologies may be combined in any way, creating a plurality ofcombinations from the descriptions presented above.

It will be further appreciated that embodiments of the present inventionmay be provided in the form of a service deployed on behalf of acustomer to offer service on demand.

While various embodiments have been described above, it should beunderstood that they have been presented by way of example only, and notlimitation. Thus, the breadth and scope of a preferred embodiment shouldnot be limited by any of the above-described exemplary embodiments, butshould be defined only in accordance with the following claims and theirequivalents.

1. A computer-implemented method, comprising: identifying an entitywithin an image that includes one or more missing portions; determininga three-dimensional (3D) model for the entity; and filling in the one ormore missing portions of the entity within the image, utilizing the 3Dmodel.
 2. The computer-implemented method of claim 1, wherein the entityis identified in response to: identifying a first attempt to fill in anarea of the image including the entity, utilizing a tool within an imageediting application for filling in regular structures; and detectingthat results of the first attempt have been undone by a user of theimage editing application.
 3. The computer-implemented method of claim1, further comprising correcting the image for distortion, wherein theone or more missing portions result from the correcting.
 4. Thecomputer-implemented method of claim 1, wherein the entity is identifiedutilizing one or more image recognition techniques that utilize localimage data to refine results.
 5. The computer-implemented method ofclaim 1, wherein metadata is associated with the entity as a result ofthe identifying, the metadata including a classification of the entityand one or more specific characteristics of the entity.
 6. Thecomputer-implemented method of claim 1, wherein determining the 3D modelincludes using edge detection to determine one or more contours of theentity within the image.
 7. The computer-implemented method of claim 1,wherein determining the 3D model includes retrieving a plurality ofdifferent 3D models from a model library.
 8. The computer-implementedmethod of claim 7, wherein the plurality of different 3D models areretrieved utilizing metadata associated with the entity.
 9. Thecomputer-implemented method of claim 7, wherein the 3D model is selectedfrom the plurality of different 3D models utilizing additional metadataassociated with the image, the additional metadata including a date andtime the image was created and a geographical location where the imagewas created.
 10. The computer-implemented method of claim 1, whereinfilling in the entity includes applying a transformation to the 3D modelfor the entity.
 11. The computer-implemented method of claim 10, whereinfilling in the entity includes using the transformation to project the3D model onto a selected area within the image that needs to be filled.12. The computer-implemented method of claim 1, wherein filling in theentity includes mapping each surface of the 3D model to the image. 13.The computer-implemented method of claim 12, wherein filling in theentity includes applying a texture to portions each surface that need tobe filled.
 14. A computer program product for filling in an entitywithin an image, the computer program product comprising a computerreadable storage medium having program instructions embodied therewith,wherein the computer readable storage medium is not a transitory signalper se, the program instructions executable by a processor to cause theprocessor to perform a method comprising: identifying the entity withinthe image that includes one or more missing portions, utilizing theprocessor; determining a three-dimensional (3D) model for the entity,utilizing the processor; and filling in the one or more missing portionsof the entity within the image, utilizing the 3D model and theprocessor.
 15. The computer program product of claim 14, wherein theentity is identified in response to: identifying a first attempt to fillin an area of the image including the entity, utilizing a tool within animage editing application for filling in regular structures; anddetecting that results of the first attempt have been undone by a userof the image editing application.
 16. The computer program product ofclaim 14, wherein the entity contains one or more missing or emptyportions as shown within the image that result from correcting the imagefor perspective or distortion.
 17. The computer program product of claim14, wherein the entity is identified utilizing one or more imagerecognition techniques that utilize local image data to refine results.18. The computer-implemented method of claim 1, wherein filling in theone or more missing portions of the entity within the image creates areconstructed entity and includes: applying a transformation to the 3Dmodel to manipulate the model so that it matches a shape and viewingangle of the entity, using the transformation to project the 3D modelonto the one or more missing portions of the entity, mapping eachsurface of the 3D model to the image, and applying a texture to the oneor more missing portions of the entity that need to be filled, where thetexture is learned from a texture of an adjacent surface outside of theone or more missing portions; wherein the reconstructed entity isvalidated by: performing image recognition on the reconstructed entityto obtain a first confidence score, comparing the first confidence scoreto a second confidence score previously obtained for the image, anddetermining a different 3D model for the entity in response todetermining that the first confidence score is not greater than thesecond confidence score.
 19. The computer-implemented method of claim 1,wherein the entity is identified utilizing one or more image recognitiontechniques, and results of implementing the one or more imagerecognition techniques include metadata including an identification ofthe entity, a classification of the entity, and one or more specificcharacteristics of the entity.
 20. A system, comprising: a processor;and logic integrated with the processor, executable by the processor, orintegrated with and executable by the processor, the logic beingconfigured to: identify an entity within an image that includes one ormore missing portions; determine a three-dimensional (3D) model for theentity; and fill in the one or more missing portions of the entitywithin the image, utilizing the 3D model.